Aerial Photograph Categorization by Cross-resolution Deep Human Gaze Behavior Learning
نویسندگان
چکیده
Accurately recognizing aerial photographs is a useful technique in many domains like autonomous driving and environmental evaluation. In practice, both low-resolution high-resolution photos are captured asynchronistically for each region, as there hundreds of earth observation satellites orbitting the earth. Realizing such multi-resolution-based region recognition difficult task due to three challenges: 1) mimicking human visual perception when they actively viewing semantic objects inside photo; 2) deeply modeling visually/semantically salient sequentially perceived by system; 3) developing cross-resolution knowledge transferal module enhance feature representation an area. To solve these challenges, we propose cross-domain photograph system leveraging spatial composition deep encoding gaze shifting path (GSP) with high-resolution. More specifically, first use active learning algorithm discover multiple object patches constructing GSP from photo. Then, aggregation-based model formulated link features learned GSP. Subsequently, novel leverages global counterparts upgrade deeply-learned Using upgraded feature, multi-label SVM classifier trained categorizing photographs. Comparative studies on our million-scale set have demonstrated competitiveness approach.
منابع مشابه
Analyzing Gaze Behavior in Complex (Aerial) Skills
Complex skills with aerial phases, such as somersaults or release-regrasp skills make up about 45 % of all elements in artistic gymnastics. The underlying mechanics of these skills have been studied extensively, and visual information is thought to assist gymnasts in skill performance. However, empirical evidence on the role of gaze behavior and its interplay with movement behavior in complex s...
متن کاملCorefrence resolution with deep learning in the Persian Labnguage
Coreference resolution is an advanced issue in natural language processing. Nowadays, due to the extension of social networks, TV channels, news agencies, the Internet, etc. in human life, reading all the contents, analyzing them, and finding a relation between them require time and cost. In the present era, text analysis is performed using various natural language processing techniques, one ...
متن کاملDeep Learning for Predicting Human Strategic Behavior
Predicting the behavior of human participants in strategic settings is an important problem in many domains. Most existing work either assumes that participants are perfectly rational, or attempts to directly model each participant’s cognitive processes based on insights from cognitive psychology and experimental economics. In this work, we present an alternative, a deep learning approach that ...
متن کاملSiamese-GAN: Learning Invariant Representations for Aerial Vehicle Image Categorization
In this paper, we present a new algorithm for cross-domain classification in aerial vehicle images based on generative adversarial networks (GANs). The proposed method, called Siamese-GAN, learns invariant feature representations for both labeled and unlabeled images coming from two different domains. To this end, we train in an adversarial manner a Siamese encoder–decoder architecture coupled ...
متن کاملSuper-Resolution via Deep Learning
The recent phenomenal interest in convolutional neural networks (CNNs) must have made it inevitable for the super-resolution (SR) community to explore its potential. The response has been immense and in the last three years, since the advent of the pioneering work, there appeared too many works not to warrant a comprehensive survey. This paper surveys the SR literature in the context of deep le...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
سال: 2022
ISSN: ['2151-1535', '1939-1404']
DOI: https://doi.org/10.1109/jstars.2022.3179663